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METHODS AND COMPOUNDS FOR MODULATING MELANOCORTIN RECEPTOR- 

LIGAND BINDING 



5 

INTRODUCTION 
Technical Field 

The present invention relates to methods and compounds for modulating melanocortin 
receptor-ligand binding. 

10 Background 

Recent biochemical investigations have identified agouti related protein ("AGRP") as 
playing a major role in the regulation of mammalian feeding behavior. The human AGRP is a 
132 residue polypeptide (SEQ ID NO: 1 ) that is a naturally occurring competitive antagonist of 
melanocortin receptors 3 and 4 ("MC3r" and "MC4r"), the overexpression of which results in 

15 adult onset obesity and diabetes in mice (Shutter, et aU Genes Dev. 1 1 :593-602 (1997); Huszar, 
etal.,CW/ 88:131-141 (1997); Hahn. et aL Nature NeuroscL 1:271-171 (1997)). AGRP binding 
to MC4r in particular is the subject of intense interest since knockout mice that do not express 
MC4r exhibit the same phenotype as caused by overexpression of AGRP (Huszar, et aL, supra). 
There is also evidence for the parallel expression of AGRP and neuropeptide Y in the arcuate 

20 nucleus of the hypothalamus, with neuropeptide Y known to stimulate feeding (Hahn, et aL, 
supra). This region of the brain also expresses MC4r and is involved in energy homeostasis. 
Research has also focused on other melanocortin receptors, their antagonists and methods for 
modulating receptor activity. See for example, Wei, et aL, WO9943709. 

The growing body of evidence linking AGRP to weight control has yet to elucidate its 

25 exact mechanism of action. However, studies on AGRP do benefit from analogy to the much 

more widely studied agouti protein, as AGRP was originally identified through the homology of 
its C-terminal region with the same region of the agouti protein (Shutter, et aL, supra). The 
agouti protein has been a focal point in obesity research for a number of years, since ectopic 
expression of the wild-type protein in mice leads to obesity and related disorders, i.e. the same 

30 symptoms as the overexpression of the more recently identified AGRP (Klebig, et aL, Proc. 

Natl Acad. Set USA 92:4728-4732 (1995); Michaud, et aL, J. Endocrinol 155:207-209 (1997). 
However, uiilike AGRP, agouti has distinct expression patterns in mice and humans, making in 
vivo work with mice less applicable to human obesity disorders. AGRP, like agouti, is selective 
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for MC3r and MC4r but has approximately 100-fold greater binding affinity than agouti at these 
receptors (Fong, et al., Biochem. Res. Commun. 237:629 61 1 (1997)). 

While full length agouti and AGRP are only 25% homologous, in their 46 residue Cys- 
rich C-terminal regions, nine of the 10 Cys residues are spatially conserved and there are a 
5 further 10 identical residues giving -40% sequence identity. Three consecutive, conserved 
residues RFF (111-113 in human AGRP ) were determined to be essential to the biological 
activity of both agouti (Kiefer, et al., Biochemistry "6:2084-2090 (1997); Kiefer, et al., 
Biochemistry 37:991-997 (1998)) and AGRP (Tota, et aL Biochemistry 38:897-904 (1999)). 
Two recent investigations have shown that the chemical iy synthesized C-terminal region of 

10 AGRP competitively antagonizes a-melanocyte stimulating hormone (ct-MSH) at melanocortin 
receptors with equal or greater potency than the full proteins (Quillan, et al., FEBS Lett. 428:59- 
62 (1998); Yang, et al, MoL Endocrinol. 13:148-1 55 (1999)), consistent with similar findings 
for agouti (Willard, et al., Biochemistry 34:12341-12346 (1995 )). Thus the Cys-rich C-terminal 
region of AGRP, referred to as minimized agouti related protein ("MARP") is a prime candidate 

1 5 for structural studies: 

CVRLH5ESCLG,oQQVPC !5 CDPCA2oTCYCR25 
FFNAF30CYCRK35LGTAM40NPCSR45T (SEQ ID NO:2) 
The covalent structure of MARP exhibits five disulfide bonds, which exist between the 
following ten Cys residues (Bures, et al., Biochemistry 37:12172-12177 (1998)): Cysi and Cysi6; 

20 Cyss and Cys 2 2; Cysj5 and Cys33; Cysj9 and Cys 4 3; Cys24 and Cysjj. 

Despite the important biological activities of AGRP, no experimental 3D structure has 
been available for this protein. The inhibitor cystine knot ("ICK") family of proteins are also 
disulfide-rich and the structures of these invertebrate toxins have been used to suggest possible 
structures for the agouti and AGRP C-terminal regions (Kiefer, et al., Biochemistry 37:991-997 

25 (1998); Tota, et aL, supra). Indeed, the recently reported disulfide map for AGRP and a 
construct containing the C-terminal domain demonstrate FCK-like pairings (Norton, et al., 
Toxicon 36:1573-1583 (1998)) of the 10 Cys residues. i-i6. 8-22, 15-33, 19-43,24-31 (using 
MARP numbering, Bures, et al., supra). Beyond such modeling, the only structural data 
published on either agouti or AGRP are circular dichroism (CD) spectra which suggest that both 

30 proteins have little regular secondary structure, although there may be some (J-sheet structure, 
consistent with ICK structural characteristics (Willard, et al., supra: Rosenfeld, et al., 
Biochemistry 37:16041-16052 (1998)). 

2 
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Accordingly, there exists a need for a better understanding of the structure of AGRP and 
related proteins such as MARP. More particularly, a need exists for the identification and 
characterization of the receptor binding regions of AGRP. MARP and other ligands for 
melanocortin receptors such as MC3r and MC4r. and molecules that affect their interaction with 
5 agouti and AGRP proteins. This would provide a major new target for iterative drug design, 

synthesis, and selection. It also would be advantageous to devise methods and compositions for 
reducing the time required to discover compounds that target the ligand binding site of 
melanocortin receptors and administer them to organisms, and mammals in particular, to 
modulate physiological processes regulated by melanocortin receptors. 
10 The instant invention meets that need in describing the 3D structure in solution of the 

human AGRP Cys-rich C-terminal region. 

SUMMARY OF THE INVENTION 
The present invention relates to the receptor binding region of the human agouti related 
protein ("AGRP"), which binds to melanocortin receptors 3 and 4 ("MC3r" and M MC4r"). 

15 Information about this region permits design of compounds that bind to the ligand binding site of 
melanocortin receptors and modulate ligand binding to the receptor. The compounds include 
agonists and antagonists that modulate melanocortin receptor activity by promoting (agonists) or 
blocking (antagonists) ligand binding to the receptor, particularly antagonists. The compounds 
of the invention can be receptor-, cell- and/or tissue-specific. 

20 The present invention also includes an NMR structure of the human AGRP C-terminus 

("MARP"), which binds to the ligand binding site of melanocortin receptors. The NMR 
structure provides a means to obtain atomic modeling information of the specific amino acids 
and their atoms forming the receptor binding region and that interact with molecules located at 
the ligand binding site. 

25 The present invention further provides methods foi identifying and designing small 

molecules that bind to the ligand binding site using atomic models of MARP. The method 
involves modeling test compounds that mimic or match the 3D conformation of MARP and 
therefore are expected to fit spatially into a melanocortin receptor ligand binding site of interest, 
using an atomic structural model comprising the MARP receptor binding region or portion 

30 thereof, screening the test compounds in a biological assay characterized by binding of a test 
compound to a melanocortin receptor ligand binding site, and identifying a test compound that 
modulates ligand binding U the melanocortin receptor. 
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The invention also includes compositions and methods for identifying ligand binding 
sites of melanocortin receptors. The methods involve examining the MARP surface to identify 
residues that modulate ligand binding. The residues can be identified by homology to the 
receptor binding region of MARP, as described herein. Overlays and superpositioning with a 
5 three dimensional model of the MARP receptor binding region, or a portion thereof that contains 
a receptor binding region, also can be used for this purpose. Additionally, aligi^neni and/or 
modeling can be used as a guide for the placement of mutations on the receptor binding region 
surface to characterize the nature of the ligand binding site on melanocortin receptors in the 
context of a cell. 

10 Also provided is a method of modulating the activity of a melanocortin receptor. The 

method can be in vitro or in vivo. The method comprises administering, in vitro or in vivo, a 
sufficient amount of a compound that binds to the l igand binding site. Preferred compounds 
bind to the site with greater affinity than ligand proteins found in a cell of interest. 

The invention further includes a method for identifying an agonist or antagonist of ligand 

15 binding to a melanocortin receptor. The method comprises providing the atomic coordinates 
comprising a melanocortin receptor binding region or portion thereof to a computerized 
modeling system; modeling compounds which match or mimic the receptor binding region and 
thus fit spatially into the melanocortin receptor ligand binding site; and identifying in an assay 
for melanocortin receptor activity a compound that increases or decreases activity of the 

20 melanocortin receptor through binding the ligand bindihg site. The melanocortin receptor 
binding region is preferably the MARP receptor binding region or portion thereof. 

Also provided is a machine-readable data storage medium with information for 
constructing and manipulating an atomic model comprising a receptor binding region or portion 
thereof. The medium comprises a data storage materia! encoded with machine readable data 

25 which, when using a machine programmed with instructions for using said data, is capable of 
displaying a graphical three-dimensional representation of a molecule that binds a meianocortin 
receptor. 

Also provided is a method of identifying a compound that selectively modulates the 
activity of one type of melanocortin receptor compared to other melanocortin receptors. The 
30 method is exemplified by modeling test compounds that fit spatially and preferentially into a 
melanocortin receptor ligand binding site of interest using an atomic structural model of the 
ivlARP receptor binding region, selecting a compound that mimics or matches one or more 
residues of the receptor binding region unique in the context of that region, and identifying in an 
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assay for ligand binding activity a compound that selectively binds to the ligand binding site 
compared to other melanocortin receptors. 

The invention finds use in the selection and characterization of peptide, peptidomimetic, 
as well as other small molecule compounds, such as small organic molecules, identified by the 
5 methods of the invention, particularly new lead compounds useful in treating melanocortin 

receptor-based discrderr The invention also Lxlu^es a group of polypeptides or peptide-related 
compounds comprised of sequences from MARP. Member? of this group may be made 
synthetically and can be modified in myriad ways. 

The invention includes a method of treating a disease state in mammals by treatment with 
10 a polypeptide comprised of sequences from MARP. A preferred embodiment would be a 

method of treating a wasting syndrome, such as HIV wasting syndrome, cachexia, or anorexia. 

The invention further includes a method of screening for a compound that is capable of 
inhibiting binding of a melanocortin agonist or antagonist to a melanocortin receptor. This 
method can be used to screen for compounds that inhibit binding at specific melanocortin 
15 receptors such as MC3R or MC4R. Further, this method can be used to identify compounds that 
interfere with either agonist or antagonist binding at the melanocortin receptor. Examples of 
antagonists that interfere with agonist binding include, but are not limited to, MARP and AGRP. 

The invention also includes methods of screening for binding of compound to a 
melanocortin receptor. Such methods can be used to screen for compounds that bind specific 
20 melanocortin receptors such as MC3R and MC4R, and Can be used for high throughput 
screening. 

The invention also pertains to a polypeptide molecule that acts as a strong antagonist of 
melanocortin receptor types 3 and 4, and is useful for the treatment of eating disorders and 
obesity. 

25 DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows the C a backbone of the MARP minimized average structure. The N- 
terminal loop, central loop and C-terminal loop are indicated. Disulfide bonds are represented 
by dashed lines. Spheres represent residues with amides protected from HX for more than 12 
hours, more than 24 hours and more than 8 days. 

30 Fig. 2 depicts the heavy atom (non-hydrogen) backbone representation of MARP for 14 

NMR structures with residues 1-34 fit to the minimized average structure (Root Mean Square 
Deviation, "RMSD" 1.49A). Only the minimized average structure (thick cylinder) is shown for 
the more disordered C-terminal loop. 

5 



BNSDOCID: <WO 0130808A1 I > 



WO 01/30808 PCT/US99/25201 

Fig. 3 shows the backbone atoms for residues 24-3 1 of the family of 20 structures with 
residues 24-31 fit to the minimized average structure. The side chain heavy atoms of residues 
25, 26 and 21, essential for activity, are shown. 

DESCRIPTION OF SPECIFIC EMBODIMENTS 
5 The agouti related protein ("AGRP") is a mammalian signaling molecule, involved in 

weight homeostasis, that causes adult onset obesity when overexpressed in mice. AGRP was 
originally identified by homology to the agouti protein, another potent signaling molecule 
involved in obesity disorders in mice. While AGRP's exact mechanism of action is unknown, it 
has been identified as a competitive antagonist of melanocortin receptors 3 and 4 ("MC3r" and 
10 "MC4r"). MC4r in particular is implicated in the hypothalamic control of feeding behavior. Full 
length agouti and AGRP are only 25% homologous, however, their active C-terminal regions are 
-40% homologous, with nine out of the 10 Cys residues spatially conserved. Until now, 3D 
structures have not been available for either agouti, AGRP or their C-terminal regions. 

This invention relates to the 3D NMR structure in solution of the human AGRP Cys-rich 
1 5 C-terminal region as determined by ! H NMR using a protein prepared by total chemical 

synthesis. As used herein the term "the NMR structure" is understood to refer to the minimized 
average of the family of NMR structures. Because biochemical investigations demonstrate that 
this minimal region retains full biological activity, this protein is referred to herein as minimized 
agouti related protein ("MARP"). MARP residues 1-46 (SEQ ID NO:2), correspond to human 
20 AGRP residues 87-132 (residues 87-132 of SEQ ID NO:l). Thus, human AGRP numbering is 
obtained by adding 86 to MARP numbering. 

MARP's topology is characterized by three large loops (referred to herein as the N- 
terminal loop, the central loop and the C-terminal loop), with four of the five disulfide bridges at 
the base of the structure, and an absence of canonical secondary structure such as helices or 
25 sheets. Two of the three loops are structurally well characterized by the NMR data as indicated 
by low RMSDs. The region of MARP containing the RFF triplet (Tota, et al., supra) (residues 
25-27 in MARP) necessary for function is located in one of the best defined regions of the 
protein. While previously reported structural models of the C-terminal region of AGRP were 
attempted based on Cys homology between AGRP and certain toxin proteins, Cys spacing is not 
30 sufficient to correctly determine the 3D fold of the molecule. Despite the conservation of Cys 
spacing and the disulfide map between MARP and other small disulfide-rich proteins from the 



6 
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ICK family, it is apparent from the structure described herein that MARP does not adopt an 
ICK-like fold. 

The 3D structure of MARP presents a basis for the development of methods and 
compositions for identifying compounds that modulate melanocortin receptor activity, in 
5 particular the activity of MC3r and MC4r. In a preferred embodiment, the 3D structure presents 
a basis for compounds that preferentially modulate MC4r activity. The compounds can be 
melanocortin receptor agonists or antagonists that bind to the ligand binding site (and that act as 
mimetics to the ligand in this regard), and promote (agonists) or block (antagonists) binding of 
the ligand to the target melanocortin receptor. Compounds that bind to the ligand binding site 
10 also are provided. The compounds can be natural or synthetic. Preferred compounds are small 
organic molecules, peptides and peptidomimetics (e.g., cyclic peptides, peptide analogs, or 
constrained peptides). 

Accordingly, one aspect of the invention involves methods for identifying and designing 
small molecules that bind to the ligand binding site using atomic models of MARP. In 

1 5 particular, the invention provides for a method of identifying a compound that modulates ligand 
binding to a melanocortin receptor. The method involves modeling test compounds that mimic 
or match the 3D conformation of MARP and therefore are expected to fit spatially into a 
melanocortin receptor ligand binding site of interest, using an atomic structural model of a 
melanocortin receptor binding region or portion thereof, preferably comprising the MARP 

20 receptor binding region or portion thereof. The test compounds can fit spatially into the ligand 
binding site of interest based upon a geometric fit of its three-dimensional structure or based 
upon the spatial arrangement of atoms presenting specific chemical properties such as charge 
and hydrophobicity. The test compounds are then screened in an assay, such as a biological 
assay, characterized by binding of a test compound to a melanocortin receptor ligand binding 

25 site, and identifying a test compound that modulates ligand binding to the melanocortin receptor. 
Details of the atomic structural model are described in detail below. 

As noted above, the 3D structure of MARP is characterized by three loops held together 
at the base by an apparent scaffold of four disulfide bonds 1-16, 8-22, 15-33 and 19-43. The 
fifth disulfide bond, 24-3 1, further stabilizes the base of the active loop which presents the RFF 

30 triplet on the protein surface. There is no identifiable canonical helical or sheet structure. It is 
clear from biochemical data that the RFF triplet is critical for the activity of MARP as a 
competitive antagonist of a MSH stimulated activation of MC4r signaling. The structure 
described herein shows that MARP is structured to present the side chains of the RFF triplet on 
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the surface of the protein and to the surrounding solvent. Recent work demonstrates that MARP 
is much more active than smaller AGRP derived peptides containing the RFF triplet (Tota, et al., 
Biochemistry 38:897-904 (1999)). Thus, the detailed fold of the central loop and perhaps the 
presence of the N- and C-terminal loops are critical for AGRP function. In addition, based upon 
5 work with chimeras of melanocortin receptors (I Gantz, unpublished data), it is possible that the 
N- and C-terminal loops may confer receptor subtype specificity. 

The previous absence of structuiul data on both AGRP and agouti encouraged the 
modeling of the C-terminal regions of these proteins onto the ICK family (Norton, et al., supra)) 
which is characterized by homologous Cys spacing (Kiefer, et al., Biochemistry 37:991-997 

10 (1998); Tota, et al., supra). The ICK family of proteins primarily consists of small (<60 

residues) disulfide-rich (three or four disulfides) toxin proteins from the venom of spiders and 
cone snails, which function as ion channel antagonists (Norton, et al M supra). The coincidence 
between the function of the majority of these toxins and the recent description of part of the 
agouti protein's mechanism of action being calcium dependent (Kim, et al., FASEBJ. 10:1646- 

15 1652 (1996); Kim, et al., Am. J. Physiol 272:E379-384 (1997); Jones, et ah. Am. J. Physiol. 

270:E192-196 (1996)) further encouraged these homology modeling efforts. The ICK motif in 
particular is characterized by the topology of the three disulfide bonds corresponding to 1-16, 8- 
22 and 15-33 in MARP. In the ICK motif the first two disulfide bonds with their intervening 
main chain atoms form a topological circle through which the third disulfide bond passes, 

20 forming the cystine knot (Norton, et al., supra). The mbtif is further characterized by the 
identification of an irregular triple stranded antiparallel P-sheet, roughly corresponding to 
residues 6-8, 20-24 and 31-34 in MARP. The remaining two disulfide bonds in MARP each 
occur in individual ICK proteins as separate examples of potential "non-motif 1 disulfide bonds, 
although no examples of ICK motif proteins with five disulfide bonds have been observed. 

25 Despite these apparent similarities, the experimental structure of MARP shows that this 

protein does not satisfy the criteria required for inclusion in the ICK family. While the first two 
disulfide bonds in MARP 1-16 and 8-22, together with the polypeptide backbone form a 
topological circle, none of the remaining disulfides passes through the circle to form a cystine 
knot. Instead, disulfide bond 15-33 is positioned adjacent to the circle with all of the fold on one 

30 side of this circle. In addition, MARP lacks the p-sheet found in ICK family proteins. The 

experimental determination of the distinctive 3D structure of MARP described herein suggests 
that Cys spacing and even the disulfide map of small Cys-rich proteins may not always be 

8 



WO 01/30808 PCT/US99/25201 

sufficient to accurately predict protein folds. These results speak to the potential limitations of 
"homology modeling" of protein structures, and may have important implications for the 
emerging field of genomic structural biology. Use of atomic modeling, alone or in combination 
with homology modeling may serve to remedy this. 
5 As described in the Examples, ligand binding studies, and analysis of atomic models 

derived from the M AR° NMR structure rew^I for .he first time a previously unknown structure 
for MARP and its receptor binding region. By "receptor binding region" is intended a structural 
segment or segments of melanocortin receptor ligands, and MARP in particular, folded in such a 
way so as to give the proper geometry and amino acid residue conformation for binding to a 

10 melanocortin receptor. By "ligand binding site" is intended a structural segment or segments of 
melanocortin receptor polypeptide chain folded in such a way so as to give the proper geometry 
and amino acid residue conformation for binding a ligand These are the physical arrangement 
of protein atoms in three-dimensional space forming a receptor binding region or a ligand 
binding site pocket or cavity. 

15 The MARP structure has three major loops: the N-terminal loop, residues 1-18 (residues 

l-18ofSEQ IDNO:2), the central loop, residues 19-34 (residues 19-34 of SEQ ID NO:2) and 
the C-terminal loop, residues 35-46 (residues 35-46 of SEQ ID NO:2). Residues forming the 
receptor binding region are amino acids corresponding to ( i.e., the same as or equivalent to) 
residues 24-3 1 of the central loop (residues 24-3 1, of SEQ ID NO:2), referred to herein as the 

20 "active" loop. In particular, residues 25, 26 and 27 (the '"RTF" triplet) of the active loop are 

critical for activity. It has been found that a polypeptide comprising the central loop, preferably 
at least a portion of the N-terminal loop, is desirable for optimal biological activity. As used 
herein, the term "at least a portion of the N-terminai loop'' is intended to mean a sequence that 
corresponds to (i.e., the same as or equivalent to), at least residues 15 to 18 of the N-terminal 

25 loop (residues 1 5-1 8 of SEQ ID NO:2), preferably at least residues 8 to 1 8 of the N-terminal 

loop (residues 8-18 of SEQ ID NO:2), and is also intended to include all of the N-terminal loop, 
i.e., residues 1 to 18 of the N-terminal loop (residues 1-18 of SEQ ID NO:2). The term is also 
intended to mean a sequence that corresponds to at least 20%, preferably at least 60%, and more 
preferably at least 90%, of the contiguous or non-contiguous amino acid residues or their atoms 

30 selected from amino acid residues 1 to 18 of the N-terminal loop (residues 1-18 of SEQ ID 
NO:2). 

The invention also includes compositions and methods for identifying receptor binding 
regions of melanocortin receptor agonist and antagonists, along with ligand binding sites of 
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melanocortin receptors. The methods involve examining the surface of a polypeptide of interest 
to identify residues that modulate ligand binding. The residues can be identified by homology to 
the receptor binding region of MARP described herein. A preferred method is alignment with 
the residues of any polypeptide corresponding to (i.e., the same as or equivalent to) residues 1-18 
5 of the N-terminal loop (residues 1 -1 8 of SEQ ID NO:2), residues 19-34 of the central loop 

(residues 19-34 of SEQ ID NO:2) and residues 35-46 of the C-terminal loop (residues 35-46 of 
SEQ ID NO:2). Overlays and superpositioning widi a three-dimensional model of the MARP 
receptor binding region, or a portion thereof that contains a receptor binding region, also can be 
used for this purpose. For example, melanocortin receptor agonists and antagonists identifiable 
10 by homology alignment include naturally occurring compounds or compounds structurally 
related to such naturally occurring compounds found in humans, along with synthetic 
compounds. 

Alignment and/or modeling also can be used as a guide for the placement of mutations 
on the receptor binding region surface to characterize the nature of the ligand binding site on 

15 melanocortin receptors in the context of a cell. To destroy the ligand binding interaction, 
preferred mutations are to charged residues (e.g., Arg, Lys, or Glu) on the basis that bulky, 
surface charged residues might disrupt ligand binding, yet preserve the overall ligand structure 
and solubility. Mutants can be tested for ligand binding as well as the relative change in strength 
of the binding interaction. Ligand-dependent ligand interaction assays also can be tested for this 

20 purpose, such as those described herein. 

Compounds that bind to the ligand binding site of melanocortin receptors can be 
identified by computational modeling and/or screening. For example, ligand agonists or 
antagonists can be identified by providing atomic coordinates comprising lite MARP receptor 
binding region or portion thereof to a computerized modeling system, modeling them, and 

25 identifying compounds that mimic or match the receptor binding region and thus would be 
expected to fit spatially into the ligand binding site. By a "portion thereof is intended the 
atomic coordinates corresponding to a sufficient number of residues or their atoms of the 
receptor binding region that interact with a melanocortin receptor capable of binding the region. 
As another example, an atomic structural model utilized for computational modeling and/or 

30 screening of compounds that mimic or match the receptor binding region and thus would be 

expected to fit spatially into the ligand binding site, may include a portion of atomic coordinates 
of amino acid residues corresponding to the region composed of residues 24-31 of the central 
loop (residues 24-3 1 of SEQ ID NO:2), or their structural and functional equivalents. An atomic 
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model can also be designed that includes residues 19-34 of the central loop (residues 19-34 of 
SEQ ID NO:2). and a model can further be designed that includes residues 19-34 of the central 
loop (residues 19-34 of SEQ ID NO:2) and some or all of residues 1-18 of the N-terminal loop 
(residues 1-18 of SEQ ID NO:2). Thus, for example, the atomic coordinates provided to the 
5 modeling system can contain atoms of MARP. all or part of the receptor binding region or a 
subset of atoms useful in the modeling and design of compounds that mimic or match the 
receptor binding region. 

The atomic coordinates of MARP and its receptor binding region, can be used for 
modeling to identify other compounds or fragments that bind melanocortin receptors. By 

10 "modeling" is intended quantitative and qualitative analysis of molecular structure/function 
based on atomic structural information and receptor-! igand agonists/antagunists interaction 
models. This includes conventional numeric-based molecular dynamic and energy minimization 
models, interactive computer graphic models, modified molecular mechanics models, distance 
geometry and other structure-based constraint models. Modeling is preferably performed using 

15 a computer and may be further optimized using known methods. By "fits spatially" is intended 
that the three-dimensional structure of a compound is accommodated geometrically by a cavity 
or pocket of a melanocortin receptor ligand binding site or by a pattern of charge or 
hydrophobicity. 

Compounds of particular interest fit spatially and preferentially into the ligand binding 
20 site. By "fits spatially and preferentially" is intended that a compound possesses a three- 
dimensional structure and conformation for selectively interacting with a melanocortin receptor 
ligand binding site. Compounds that fit spatially and preferentially into the ligand binding site 
interact with amino acid residues forming the ligand binding site. More specifically, these 
compounds mimic or match the MARP receptor binding region The present invention also 
25 includes a method for identifying a compound capable of selectively modulating ligand binding 
to different melanocortin receptors. The method comprises the steps of modeling test 
compounds that fit spatially and preferentially into the ligand binding site of a melanocortin 
receptor of interest using an atomic structural model of a MARP, screening the test compounds 
in a biological assay for melanocortin receptor activity characterized by preferential binding of a 
30 test compound to the ligand binding site of a melanocortin receptor, and identifying a test 

compound that selectively modulates the activity of a melanocortin receptor. Such receptor- 
specific compounds are sele, ted that exploit differences between the ligand binding sites of one 

type of receptor versus a second type of receptor. 

11 
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The invention also is applicable to generating new compounds that distinguish 
melanocortin receptor isoforms. This can facilitate generation of either tissue-specific or 
function-specific compounds. 

The receptor-specific compounds of the invention preferably interact with 
5 conformational^ constrained residues of the ligand binding site that are conserved among one 
type of receptor compared to a second type of receptor. " Conformational ly con Tain?:!" is 
intended to refer to the three-dimensiona 1 structure of a chemical or moiety thereof having 
certain rotations about its bonds fixed by various local geometric and physical-chemical 
constraints. Conformationally constrained structural features of a ligand binding site include 
10 residues that have their natural flexible conformations fixed by various geometric and physical- 
chemical constraints, such as local backbone, local side chain, and topological constraints. 
These types of constraints are exploited to restrict positioning of atoms involved in receptor- 
ligand recognition and binding. 

As described in the Examples, residues 25, 26 and 27 of the MARP receptor binding 
15 region are shown to be critical for activity. These three residues, along with the overall structure 
and composition of the N-terminus and central loop, appear to be necessary for optimal 
biological activity. Accordingly, modification to these residues can be exploited in the 
identification and design of compounds that modulate ligand binding to one melanocortin 
receptor compared to another. 
20 For modeling, docking algorithms and computer programs that employ them can be used 

to identify compounds that match or mimic the MARP receptor binding region. For example, 
docking programs can be used to predict how a small molecule of interest can interact with the 
melanocortin receptor ligand binding site. Fragment-based docking also can be used in building 
molecules de novo inside the ligand binding site, by placing chemical fragments that 
25 complement the site to optimize intermolecular interactions. The techniques can be used to 

optimize the geometry of the binding interactions. This design approach has been made possible 
by identification of the receptor binding region structure thus, the principles of molecular 
recognition can now be used to design a compound which matches the structure of this region. 
Compounds that structurally match or mimic the MARP receptor binding region serve as a 
30 starting point for an iterative design, synthesis and test cycle in which new compounds are 

selected and optimized for desired properties including affinity, efficacy, and selectivity. For 
example, the compounds can be subjected to addition modification, such as replacement and/or 
addition of R-group substituents of a core structure identified for a particular class of binding 
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compounds, modeling and/or activity screening if desired, and then subjected to additional 
rounds of testing. 

Computationally small molecule databases can be screened for chemical entities or 
compounds that can bind in whole, or in part, to a melani.cortin receptor ligand binding site of 
5 interest. In this screening, the quality of fit of such entities or compounds to the binding site 

may be judged either by shape complementarity (DesJalais et al., J. Med. Chem. (1988) 31 :722- 
729) or by estimated interaction energy (Meng et al., J C-^mp. Chem. (1992) 75:505-524). The 
molecule databases include any virtual or physical database, such as electronic and physical 
compound library databases, and are preferably used in developing compounds that modulate 

10 ligand binding. 

Compounds can be designed intelligently by exploiting available structural and 
functional information by gaining an understanding of the quantitative structure-activity 
relationship (QSAR), using that understanding to design new compound libraries, particularly 
focused libraries having chemical diversity of one or more particular groups of a core structure, 

15 and incorporating any structural data into that iterative design process. For example, one skilled 
in the art may use one of several methods to screen chemical entities or fragments to compare 
them to the 3D structure of the AGRP C-terminus and thus, their ability to associate with the 
ligand binding site of a melanocortin receptor of interest. This process may begin by visual 
inspection of, for example, the receptor binding region on the computer screen. Selected 

20 fragments or chemical entities may then be positioned into all or part of the region. Docking 
may be accomplished using software such as Quanta and Sybyl, followed by energy 
minimization and molecular dynamics with standard molecular mechanics force-fields, such as 
CHARMM and AMBER. 

Residues comprising a receptor binding region can be modeled to look for energetic 

25 contributions and interaction with the bound chemical cntitj . For example, a compound or 

fragment can be designed to contain hydrophobic groups that interact with hydrophobic residues 
of the ligand binding site. 

Specialized computer programs may also assist in the process of selecting chemical 
entity fragments or whole compounds. These include: GRID (Goodford, J, Med. Chem. (1985) 

30 25:849-857; available from Oxford University, Oxford, UK); MCSS (Miranker et al. Proteins: 
Structure, Function and Genetics, (1991) 77:29-34; available from Molecular Simulations, 
Burlington, M A); AUTODOCK (Goodsell et al. Proteins: Structure, Function and Genetics 
(1990) 5:195-202; available from Scripps Research Institute, La Jolla, CA); and DOCK (Kuntz 
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et al, J. Mol Biol. (1982) 767:269-288; available from University of California, San Francisco, 
CA). 

Additional commercially available computer databases for small molecular compounds 
include Cambridge Structural Database and Fine Chemical Database (Rusinko, Chem. Des 
5 Auto. News (1993) £-44-47) 

Once suitable chemical entities or fragments have been selected, they can be assembled 
into a single compound. Assembly may be proceeded by visual inspection of the relationship of 
the fragments to each other on the three-dimensional image displayed on a computer screen in 
relation to the structure coordinates of a melanocortin receptor. This can be followed by manual 
10 model building using software such as Quanta or Sybyl. 

Useful programs to aid one of skill in the art in connecting the individual chemical 
entities or fragments include: CAVEAT (Bartlett et al., "CAVEAT: A Program to Facilitate the 
Structure-Derived Design of Biologically Active Molecules", In: Molecular Recognition in 
Chemical and Biological Problems", Special Pub., Royal Chem. Soc. (1989) 75:182-196; 
15 CAVEAT is available from the University of California, Berkeley, CA); 3D Database systems 
such as MACCS-3D (MDL Information Systems, San Leandro, CA; reviewed in Martin, J. Med 
Chem. (1992) 55:2145-2154); and HOOK (available from Molecular Simulations, Burlington, 
MA). 

In addition to building a compound in a step-wise fashion, one fragment or chemical 

20 entity at a time as described above, compounds that bind to a ligand binding site of interest also 

may be designed as a whole or de novo using some portion(s) of the AGRP C-terminus, which is 

a molecule known to bind to the site. These methods include: LUDI (Bohm, J. Camp. Aid. 

Molec. Design (1992) 6:61-78; LUDI is available from Biosym Technologies, San Diego, CA); 

LEGEND (Nishibata et al., Tetrahedron (1991) 47:8985; LEGEND is available from Molecular 

25 Simulations, Burlington, MA); and LeapFrog (available from Tripos Associates, St. Louis, MO). 

Other molecular modeling techniques may also be employed in accordance with this 

invention. See, for example, Cohen et al., 1 Med. Chem, (1990) 53:883-894); Navia et al., Curr. 

Opin. Struct. Biol. (1992) 2:202-210). For example, where the structures of test compounds are 

known, a modei of the test compound may be superimposed over the model of the structure of 

30 the invention. Numerous methods and techniques are known in the art for performing this step, 

any of which may be used. See, for example, Farmer, "Drug Design" Aliens, E.J., ed., 10:119- 

143 (Academic Press, New York, 1980); U.S. Patent No. 5,331,573; U.S. Patent No. 5,500,807; 

Verlinde, Structure, (1994) 2:577-587); and Kuntz et al., Science, (1992) 257:1078-1082). The 

14 
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model building techniques and computer evaluation systems described herein are not a limitation 
on the present invention. 

Using these computer modeling systems a large number of compounds may be quickly 
and easily examined and expensive and lengthy biochemical testing avoided. Moreover, the 
need for actual synthesis of many compounds can be substantially reduced and/or effectively 
eliminated. 

Compounds identified through modeling can be screened in an assay characterized by 
binding of the compound to a ligand binding site of interest for ligand binding activity, such as a 
biologically based assay. Screening can be in vitro and/or in vivo. Preferred assays include cell- 
free competition assays and cell culture based assays. The biological screening preferably 
centers on activity-based response models, binding assays (which measure how well a 
compound binds to the receptor), and bacterial, yeast and animal cell lines (which measure the 
biological effect of a compound in a cell). The assays can be automated for high capacity - high 
throughput screening (HTS) in which large numbers of compounds can be tested to identify 
compounds with the desired activity. 

As an example, in vitro binding assays can be performed in which compounds are tested 
for their ability to block the binding of a ligand protein, fragment, fusion or peptide thereof, to a 
ligand binding site of interest. For cell and tissue culture assays, they may be performed to 
assess a compound's ability to block function of cellular ligands, such as competitively inhibiting 
[Nle 4 , D-Phe 7 ]a-MSH ("NDP-MSH"), as described in the examples and the references cited 
therein, which are incorporated by reference. 

In a preferred embodiment, compounds of the inve-tk n bind to a melanocortin receptor 
ligand binding site with greater affinity than the cellular ligand proteins. Tissue profiling and 
appropriate animal models also can be used to select compounds. Different cell types and 
tissues also can be used for these biological screening assays. Suitable assays for such screening 
are described herein and in the Examples and the references cited therein, which references are 
incorporated herein in their entirety by reference. For example, ligands or binding fragments 
thereof can be expressed and/or assayed for their ability to inhibit cAMP generation in cells 
transfected with MC4r (see Example 1), as described in the examples and the references cited 
therein, which are incorporated by reference. 

The compounds selected can have agonist and/or antagonistic properties. The 

compounds also include those that exhibit new properties with varying mixtures of agonist and 

antagonist activities, depending on the effects of altering ligand binding in the context of 

15 
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different activities of melanocortin receptors which are mediated by proteins other than Iigands, 
and which interact with the receptors at locations other than the Iigand binding site. The 
compounds also include those, which through their binding to receptor locations that are 
conformationally sensitive to ligand binding, have allosteric effects on the receptor by stabilizing 
or destabilizing the ligand-bound conformation of the receptor, or by directly inducing the same, 
similar, or different conformational changes induced in the receptor by ligand blading. 

Of particular interest is use of sud compounds in a method of modulating melanocortin 
receptor activity in a mammal by administering to a mammal in need thereof a sufficient amount 
of a compound that fits spatially and preferentially into a ligand binding site of a melanocortin 
receptor of interest. By "modulating" is intended increasing or decreasing activity of a 
melanocortin receptor. For example, pre-clinical candidate compounds can be tested in 
appropriate animal models in order to measure efficacy, absorption, pharmacokinetics and 
toxicity following standard techniques known in the art. Compounds exhibiting desired 
properties are then tested in clinical trials for use in treatment of various melanocortin receptor- 
based disorders. These include feeding disorders, including wasting syndromes, obesity, and 
other disorders related to hypothalamic control of feeding, A wasting syndrome is an illness 
characterized by significant weight loss accompanied by other indicia of poor health, including 
poor appetite, gut disorder, or increased metabolic rate. Wasting syndromes include, but are not 
limited to, the wasting syndrome afflicting some patients diagnosed with Acquired Immune 
Deficiency Syndrome (AIDS) and various cancers. As methods of treating other symptoms of 
diseases such as AIDS progress, the incidence of wasting syndrome as the cause of death 
increases. Improved prophylaxis and treatment for HIV wasting syndrome is required (Kiavick, 
et al., Arch Intern. Med 157:2069-2073, 1997). Anorexia and cachexia are well-known results 
of cancer that contribute to morbidity and mortality (Simons, et al Cancer 82:553-560, 1998; 
Andrassy & Chwals, Nutrition 14: 124-129, 1998). The reasons for the significant weight loss 
are multiple and may be directly related to the tumor, such as increased metabolic rate, but also 
include decreased intake due to poor appetite or gut involvement. Further, excessive leptin-like 
signaling may contribute to the pathogenesis of wasting illness (Schwartz, et al., Pro. Nutr. Soc. 
56:785-791, 1997). 

The invention also provides for the NMR structure of the human AGRP C-terminus, 

preferably embodied in a computer readable form. Synthesis of MARP and concentrated 

solutions adequate for NMR spectroscopy are described in the examples. After the NMR 

structure of MARP is determined, the structural information can be used in computational 

16 
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methods to design synthetic agonist and antagonist compounds for the melanocortin receptors, 
and further structure-activity relationships can he determined through routine testing using the 
assays described herein and known in the art. It is expected that the structure coordinates of the 
NMR structure of MARP, as provided in Appendix 1 . will be particularly useful for solving the 
5 NMR structure of other melanocortin receptor antagonists or agonists. The coordinates in 

Appendix 1 represent the minimized average <~.f the -ooi-Juiates of a family of NMR structures. 
The minimized average was determined by (1 : ..aicuiaui^ in average from approximately 20 
NMR structures found to meet the experimental NMR distance restraints and (2) subjecting this 
calculated average to a final round of energy minimization. 

10 One method that may be employed for this purpose is molecular replacement. In this 

method, the unknown NMR structure, may be determined using the structure coordinates of this 
invention as provided in Appendix 1 . The Appendix 1 coordinates for the human AGRP C- 
terminus have been deposited with the Brookhaven National Laboratory Protein Data Bank, and 
have been assigned Brookhaven Protein Data Bank Accession Number lqu8. This method will 

15 provide an accurate structural form for the unknown NMR structure more quickly and efficiently 
than attempting to determine such information ab initio. 

One aspect of the invention is an NMR structure of the minimized agouti related protein, 
embodied in a computer readable media. Atomic coordinate information gleaned from the NMR 
structure of the invention is preferably stored and provided in the form of a machine-readable 

20 data storage medium. This medium contains information for constructing and/or manipulating 
an atomic model of a receptor binding region or portion thereof For example, the machine 
readable data for the receptor binding region may comprise structure coordinates of amino acids 
corresponding to (i.e., the same as or equivalent to) residues 1-18 of the N-terminal loop, 
(residues 1-18 of SEQ ID NO:2), and/or residues 19*34 of the central loop (residues 19-34 of 

25 SEQ ID NO:2) and/or residues 35-46 of the C-termmal loop ( residues 35-46 of SEQ ID NO:2), 
or a homologue of the molecule or molecular complex comprising the region. The machine 
readable data for the receptor binding region may comprise structure coordinates of amino acids 
corresponding to residues 1-18 of the N-terminal loop, (residues 1-18 of SEQ ID NO:2) and 
residues 19-34 of the centra] loop (residues 19-34 of SEQ ID NO:2). The machine readable data 

30 may also comprise residues 24-3 1 of the central loop (residues 24-3 1 of SEQ ID NO:2) and a 

portion of the N-terminal loop, for example residues 15 to 18 (residues 15-18 of SEQ ID NO:2), 

residues 7 to 18 (residues 7-18 of SEQ ID NO:2), or residues 1 to 18 (residues 1-18 of SEQ ID 

NO:2). The homologues comprise a receptor binding region that has a root mean square 
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deviation from the backbone atoms of the amino acids of not more than 2. 54 A, preferably not 
more than 1 .66 A. 

The machine-readable data storage medium can be used for interactive drug design and 
molecular replacement studies. For example, a data storage material is encoded with a first set 
5 of machine-readable data that can be combined with a second set of machine-readable data. For 
molecular replacement, the first set of data can comprise a Fourier transform of at least a portion 
of the structural coordinates of the AGRP C-terminns or portion thereof of interest, and the 
second data set comprises an X-ray diffraction pattern of a molecule such as the nielanocortin 
receptor of interest. Using a machine programmed with instructions for using the first and 
1 0 second data sets a portion or all of the structure coordinates corresponding to the second data can 
be determined. 

Protein for determination of the NMR structure and the assays described herein can be 
produced using synthetic or recombinant expression techniques, For example, AGRP C- 
terminal portion (MARP) can be readily synthesized using techniques described in Yang, et al. 
15 supra. In addition, high level expression of melanocortin receptors can be obtained in suitable 
expression hosts such as E. coli, yeast and other eukaryotic expression systems can also be used. 
The receptors can be expressed alone, as fragments of the mature or full-length sequence, or as 
fusions to heterologous sequences. 

Initially, purification of the melanocortin receptor can be obtained by conventional 

20 techniques, such as hydrophobic interaction chromatography (HPLC), ion exchange 

chromatography (HPLC), and heparin affinity chromatography. To achieve higher purification 
of melanocortin receptors, the receptors can be ligand-shift-purified using a column that 
separates the receptor according to charge, such as an ion exchange or hydrophobic interaction 
column, and then bind the eluted receptor with a ligand, especially an agonist. The ligand 

25 induces a change in the receptor's surface charge such that when re-chromatographed on the 

same column, ligand-bound receptor is separated from unliganded receptor. Usually saturating 
concentrations of ligand are used in the column and the protein can be preincubated with the 
ligand prior to passing it over the column. The structural studies detailed herein indicate the 
general applicability of this technique for obtaining super-pure melanocortin receptors for use in 

30 assays to assess the antagonist or agonist activity of candidate compounds. 

There are many uses and advantages provided by the present invention. For example, the 

methods and compositions described herein are useful for identifying peptides, peptidomimetics 

or small natural or synthetic organic molecules that modulate melanocortin receptor activity. 

18 
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The compounds are useful in treating melanocortin receptor-based disorders. Methods and 
compositions of the invention also find use in characterizing structure, function relationships of 
natural and synthetic ligand compounds. 

Another aspect of the invention also pertains to a polypeptide molecule that acts as a 
5 strong antagonist of melanocortin receptor types 3 and 4. and is useful for the treatment of eating 
disorders and obesity. As discussed above, research has demonstrated that the final 46 residues 
of the human AGRP sequence ("MARP"). possesses full receptor antagonist ar^vity. The 
structure of MARP, solved by use of nuclear magnetic resonance and described hefein, suggests 
that the final 13 residues of MARP are unstructured and thus may not play a role in receptor 
10 binding or antagonism. Thus, substantial further minimization of human AGRP may be 

achieved. Specifically, it is expected that the sequence (indicated by the single letter amino acid 
code): 

CVRLH 5 ESCLG I oQQVPC 15 CDPAA2oTCYCR25FFNAF3oCYC (SEQ ID NO:3) 
which is designated "MARP-33 M (based upon its 33 amino acid length) will fold with proper 

15 native-like disulfide bonds and possess the full biological activity of MARP. This sequence is 
equivalent to the first 33 amino acids of MARP but contain a Cys^Ala substitution at position 
19 (indicated as "A") to avoid having a non-bridged thiol. 

MARP-33 finds utility as a potential therapeutic for the treatment of eating disorders and 
obesity, and may be administered to a patient in need thereof, to increase eating and fat 

20 deposition. In addition, the MARP-33 sequence may be modified such that the modified 

MARP-33 may have therapeutic utility as a competiti ve inhibitor of in vivo human AGRP. Such 
modifications can include changing the sequence to increase binding affinity, to increase the 
level of antagonism and to increase stabil ity of the molecule. 

Accordingly, one aspect of the invention pertains to a method of treating a disease state 

25 in mammals that is alleviated by treatment with a polypeptide having an amino acid sequence: 
CVRLHESCLGQQVPCCDPAATCYCRFFNAFCYC (SEQ ID NO:3) 
which method comprises administering to a mammal in need of such a treatment a 
therapeutically effective amount of the polypeptide, which can be administered, by way of 
illustration and not limitation, in a liquid formulations or a solid formulations, such as in the 

30 form of a pharmaceutical^ acceptable salt thereof Such a disease state can be a wasting 

syndrome, in addition, the invention encompasses a pharmaceutical composition comprising a 
therapeutically effective amount of a polypeptide of this sequence (SEQ ID NO:3), in a liquid, 
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solid or other suitable formulation, including pharmaceutically acceptable carriers, stabilizers, 
excipients and so forth, such as are well known in the state of the art. 

In addition, the structure and sequence of MARP-33 is a useful tool in rational drug 
design for the design of peptide-based therapeutics. Further, the structure of this molecule will 
5 facilitate the development of non-peptide based therapeutics. 

The following examples illustrate various aspects of this invention. The^c examples do 
not limit the scope of this invention. 

EXAMPLES 
Abbreviations 

10 Vhnq three bond aH-NH scalar coupling constant 

AGRP/ART Agouti Related Protein/Agouti Related Transcript 

Conformational shift experimental chemical shift-random coil chemical shift 
DQF-COSY two-dimensional double-quantum filtered correlation spectroscopy 

HX hydrogen-deuterium exchange 

1 5 ICK Inhibitor Cystine Knot 

MARP Minimized human Agouti Related Protein, residues 87-132 of human 

AGRP 

MC3r/MC4r Melanocortin Receptor % 

MRE Mean Residue Ellipticity 

20 a-MSH a-Melanocyte Stimulating Hormone 

NDP-MSH [Nle 4 , D-Phe 7 ]a-MSH, a superpotent melanocortin agonist 

nOe melanocortin Overhauser enhancement 

NOESY two-dimensional melanocortin Overhauser spectroscopy 

TOCSY two-dimensional total correlation spectroscopy 

25 Methods and Materials 

A. Chemical protein synthesis : A^-Acetyl-MARP fSEQ ID NO:2) and A^-acetyi- 
MARP(Arg25Ala) were synthesized, folded and purified to each give a protein containing five 
disulfide bonds, as reported in Yang, et aL, supra. 

B. CD experiments : Circular dichroism ("CD") spectra were recorded at 25°C on an 
30 Aviv 60DS spectropolarimeter in a rectangular 1 mm path length cuvet for concentrations up to 

60 |iM. For concentrations higher than this a round cell with a 0.1 mm path length was used. 
All CD samples were 50 mM potassium phosphate, pH 4.25. Concentration dependence was 

20 
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ruled out in the range 20 (aM-1 mM. Temperature dependence was determined for 5-85°C. The 
spectra are superimposable from 5 to 45°C. 

C. NMR sample preparation : The activity of MARP used for the NMR sample and 
that of a single mutant were assayed by measuring the inhibition of cAMP production in the 

5 presence of NDP-MSH (Shutter, et aL supra, Huszar, et aL supra) in HEK-293 cells transfected 
with human MC4r. Control experiments we. . performed with no MARP. The NMR samples 
were found to be native-like with complete activity. NMR samples contained approximately 1 .9 
mM MARP at pH 4.2 in 50 mM KH 2 P0 4 buffer in 90% H 2 O/10% D 2 0". Additional samples for 
HX experiments were prepared by lyophilization of protonated samples followed by 
10 reconstitution in 700 \x\ D 2 0. 

D. NMR experiments : ] H 2D NMR spectra were principally acquired at 15°C on a 
Varian 500 Unity Plus spectrometer using inverse probes. NMR data were routinely acquired 
with a 6000 Hz spectral width, 4096 complex points in t 2 , and 512 (TOCSY/DQF-COSY) or 700 
(NOESY) increments in 0- All spectra were processed using the MNMR package (Carlsberg 

15 Laboratory, Department of Chemistry, Denmark) and analyzed using XEASY (Bartels, et al., J. 
BiomoL NMR 5:1-10 (1995)), with chemical shifts referenced to 1.4-dioxime at 3.743 ppm. 
Sequential assignments of all backbone and >90% of side chain protons were accomplished 
using standard methods (Wuthrich, K. "NMR of Proteins and Nucleic Acids" (John Wiley and 
Sons, New York, 1986); Redfield, C. in "NMR of Macromolecules: A Practical Approach, pp. 

20 71-99 (Roberts, G.K.C., Ed., IRL Press at Oxford University Press, Oxford, 1993)) for 50 ms 

TOCSY, 150 ms NOESY and DQF-COSY data. Additional data sets were acquired at 25°C and 
30°C to resolve ambiguities. Examination of the three Pro residues identified nOes consistent 
only with trans-Pro. Four additional peaks were identified in the uN region of the TOCSY 
spectrum, however, associated spin systems could not be identified and neither could nOes to the 

25 peaks in question. 

NOESY data for distance restraints were collected at 15°C using the WET sequence 
(Smallcombe, et al., J. Magn. Reson. Ser. A 1 17:295-303 (1995)) for water suppression, 1.6 s 
recycle delay and a mixing time of 80 ms in both H 2 0 and D2O. 3 JHNaCOupling constants at 
25°C were determined by both linear least squares fitting of the antiphase doublets in a DQF- 

30 COSY and also using the INFIT (Szyperski, et al., J. Magn. Reson. 99:552-560 (1992)) module 
of XEASY with 150 ms NOESY data. These methods agreed to within ±0.5 Hz for all of the 
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measured coupling constants. At 15°C larger intrinsic linewidths precluded accurate 
measurement of 3 J H Na- 

For amide exchange experiments, the magnet was preshimmed on a 21 residue peptide 
sample at pH 4 in D 2 0/phosphate buffer. The first TOCS Y experiment was begun 23 minutes 
5 after reconstituting the protonated sample in D 2 0. Four TOCS Y experiments identical to those 
described above, except for the number of t } increments, were acquired back to back at 15°C 
over a period of 24 hours. The first three consisted of 1 50 /; increments and the final experiment 
300 increments. NOESY and DQF-COSY spectra were also acquired as described above. 
E. Structure calculations : Final structure calculations included the covalent 
10 connectivity of the published disulfide map (Bures, et aL supra), and were based on a total of 
414 interproton distance constraints derived from the 80 ms 2D NOESY spectra and 34 
backbone O dihedral angle constraints derived from coupling constant measurements, giving a 
total of 448 total restraints, or 9.7 restraints per residue. The distance restraints can be broken 
down into 228 intraresidue (backbone to side chain only), 129 sequential, 20 medium range 
1 5 (1<i/-7i<5 ) and 37 long range (i/-_/i>5) restraints. These restraints were assigned as strong, 

medium or weak. The total numbers of restraints in each category were 95 strong, 246 medium, 
and 77 weak. All categories had a lower limit of 1 .6A, with upper limits of 2.8, 3.5 and 5 A for 
the strong, medium and weak categories, respectively. Trial structures were generated using the 
simulated annealing protocol from CNS version 0.4a (anneal, inp) with SUM averaging for the 
20 nOe distances (Brunger, et al.,Acta Crystallogr. Seel. D Biol. Crystallogr. 54.905-921 (1998); 
Nilges, et aL, FEES Lett. 239:129-136 (1988); Nilges, et aL, Protein Eng. 2:27 38 (1988); 
Kuszewski, et aL, J. Magn. Reson. Ser. B 1 12:79-81 (1996), Stein, et aL, J. Magn. Reson. 
124:154-164 (1997)). Twenty structures with no bond or nOe angle violations were used to 
represent the solution structure of MARP (see Table 1 for Root Mean Square Deviations, 
25 "RMSDs"). Structures were displayed using MOLMOL (Koradi, et aL, Science 278:135-138 
(1997)). 

Example 1 

Chemical synthesis and characterization of MARP 
The synthesis and biochemical characterization of MARP were previously reported in 
30 Yang, et aL, supra. The N-terminal residue (Cys-1) of MARP corresponds to the first Cys (Cys- 
87) of the Cys-rich region in full length, 132 residue human AGRP. The material used for the 
MARP NMR sample showed native-like activity as measured by its ability to competitively 
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inhibit NDP-MSH at MC4r, as has been shown in previous studies (Yang, et al., supra: Ollmann, 
et al.. Science 278:135-138 ( 1997)). Previous mutational studies of agouti and AGRP showed 
residues Arg-25, Phe-26. Phe-27 (the RFF triplet. Tota. et al., supra) and Asp- 17 (in agouti) to 
be determinants of receptor binding (Kiefer, et al.. Biochemistry 36:2084-2090 (1997); Kiefer, et 
5 ah. Biochemistry 37:991-997 (1998); Tota, et al., supra)). 

This experiment measured inhibition of NDP-MSH stimulated cAMP generation in cells 
transfected with MC4r. A control experiment was conducted with no MARP resent. The 
addition of MARP (5.0 x 10' 6 M) demonstrated competitive inhibition of NDP-MSH. The 
addition of MARP with Arg-25 to Ala substitution (5.0 x 10" 6 M) demonstrated loss of inhibition 

10 due to a mutation in the active loop. This experiment thus showed that the replacement of Arg- 
25 by Ala results in a complete loss of inhibitory activity. 

The far-UV CD spectrum of MARP was measured in 50 mM phosphate buffer at pH 
4.25 as a function of temperature with spectra shown every 5°C from 5°C to 30°C and also at 
45°C. The spectrum was similar to that reported for a similar C-terminal fragment (residues 85- 

15 132) of AGRP (residues 85-132 of SEQ ID NO: 1) reported by Rosenfeld, et al., supra, and is 
characterized by a negative maximum at approximately 198 nm and a slight negative maximum 
at 245 nm. There was little indication of canonical a-helix, however the spectrum did suggest 
some p character or turns. The near-UV CD spectrum, indicative of tertiary structure, showed a 
weak minimum at approximately 275 nm which Gould be attributed to the disulfide bonds and 

20 possibly restricted orientations of the side chains of the Tyr-23 and Tyr-32. 

At 25°C between 20 jiM and 1.0 mM there was no observable concentration dependence 
as measured by CD. Between 1 mM and 1.9 mM there vvere no concentration dependent 
changes in the NMR spectra (i.e. linewidths, chemical shifts, etc. all remain constant). The far- 
UV CD spectrum remained constant between 5°C and 45°C, a temperature range well beyond 

25 that of the present NMR experiments. Thus, by all indications, MARP existed as a monomer 

and did not exhibit temperature dependence under the conditions of the NMR experiments. The 
characteristics of the NMR spectra were indicative of a well folded protein with a single 
predominant conformed 3 Jhno coupling constants and temperature coefficient also indicated a 
fully folded, non-random coil conformer as shown in Table 1 below, which shows the MARP 

30 coupling constants at 25°C and NH temperature coefficients in 50 min phosphate buffer at pH 
4.2. 
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Table 1 





3 J N na coupling 


NH temperature 


Residue 


constant (Hz) 


coeincient (ppD/ L) 


Cys-1 




6.93 


Val-2 


8.79 


3.63 


Arg-3 


7.00 


6.37 


Leu-4 


5.64 


3.47 


His-5 


7.50 


2.58 


Glu-6 


9.34 


2.90 - 


Ser-7 




5.86 


Cys-8 


9.92 


3.84 


Leu-9 


4.40 


2.43 


Gly-10 




7.06 


Gin- 11 


8.14 


4.94 


Gin- 12 


8.00 


4.38 


Val-13 


9.03 


5.56 


Pro- 14 




0.00 


Cys-1 5 


4.33 


6.74 


Cys-1 6 


5.42 


3.10 


Asp- 17 




4.81 


Pro- 18 




0.00 


Cys-1 9 


8.81 


3.84 


Ala-20 


9.64 


3.23 


Thr-21 


9.36 


4.04 


Cys-22 


5.76 


7.56 


Tyr-23 




2.01 


Cys-24 




3.14 


Arg-25 


5.90 


2.93 


Phe-26 


8.70 


3.67 


Phe-27 


3.68 


6.21 


Asn-28 


7 82 


5.66 


Ala-29 


7.56 


2.10 


Phe-30 


7 59 


4.84 


Cys-31 


9.17 


5.84 


Tyr-32 


9,49 


4.43 


Cys-33 




1.13 


Arg-34 


9.34 


2.74 


Lys-35 


7.25 


8.80 


Leu-36 


7.74 


6.43 


Gly-37 




7.00 


Thr-38 


8.07 


2.07 


Ala-39 


4.48 


7.67 


Met-40 


7.50 


2.98 


Asn-41 


8.19 


1.08 


Pro-42 




0.00 


Cys-43 


7.24 


7.86 


Ser-44 


7.26 


8.38 
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Vnhq coupling NH temperature 

Residue constant (Hz) coefficient (ppb/°C) 

Arg-45 7.54 4.47 

Thr-46 4.61 



Example 2 
Structural derHptk n of MARP 
The minimized average NMR structure of MARP is shown in Fig. 1 . Consistent with the 
5 far-UV CD spectrum, MARP showed little evidence of helix or sheet secondary stnlcture. The 
disulfide bonds (1-16, 8-22, 15-33, 19-43 and 24-31) appeared to form a scaffold upon which the 
structure was apportioned into three major loops, which are referred to as the N-terminal loop 
(residues 1-18), the central loop (residues 19-34) and the C-terminal loop (residues 35-46), and 
are indicated in Fig. 1 . RMSDs for the individual loops are reported in Table 2 below: 
10 Table 2 

Summary of MARP backbone and heavy atom RMSD s 



Region (residues) Backbone RMSD* (A) Heavy atom RMSD a (A) 

Global (1-46) 2.54 3.26 

N- and active loops (1-34) 1.66 2.38 

15 N-terminal loop (1-18) 1.31 2.03 

Central loop ( 19-34) 1.51 2.22 

( Active loop (24-31) 0.69 1 1.53 

C-terminal loop (35-46) 2.36 3.43 



* Determined by fitting the family of 20 NMR structures to the minimized average structure 

20 The N-terminal and central loops were much better defined both within the loops and 

with respect to each other than the C-terminal loop. The backbone RMSD for the entire protein 
(2.54A) was of the same order as that of the C-terminal loop (2.36A), while the backbone 
RMSD for residues 1-34 (1.66A) was of the order of the individual N-terminal and central loops. 
To demonstrate limited backbone structure variability of the N-terminal and central loops, a 

25 superposition of 14 structures (selected for clarity) for residues 1-34 and the MARP minimized 
average structure (residues 1-46) is shown in Fig. 2. 

Four of the five disulfide bonds are located at the base of the structure where they appear 
to pinch together the bottoms of the loops to form the "core" of the protein (Fig. 1). The 
exception is disulfide bond 24-3 1 which stabilizes the central loop. The central loop, residues 
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19-34, contains the RFF triplet determined to be critical for activity. This motif is situated 
within an even smaller, well defined loop bound by Cys-24 and Cys-31 which is referred to as 
the "active" loop. The side chain atoms of the RFF triplet residues are located at the surface of 
the protein as depicted in Fig. 3. Recent experiments further highlight the importance of this 
5 active loop. These studies demonstrate that short cyclic peptides corresponding to residues 24- 
31 of human AGRP do in fact antagonize MC3r and MC4r (Tota, et al, supra). 

Inspection of the family of NMR structures and consideration of the observed HX 
revealed a structure for the central loop that is best described as an irregular hairpin with a well 
defined loop from Cys-24 to Cys-3 1 (RMSD 0.6A, Fig. 3) and a stem region which is both 

10 twisted around and curved along its z-axis (Fig. 1). This characterization was supported by 
critical examination of the nOe, 3 J ma and chemical shift data. As shown in Fig. 3, the active 
loop is highly constrained with the RFF triplet side chains exposed to solvent. Arg-25 and Phe- 
27 point out into the solvent, while one face of the Phe-26 aromatic ring rests parallel against the 
surface of the protein. Though the active loop satisfies several of the determinants for an Q-loop 

15 (Leszczynski, et al., Science 234:849-855 (1986)), the side chain orientation of Arg-25 and Phe- 
27 precludes its definition as such since Q-loop side chains generally pack within the loop of 
backbone atoms. 

HX experiments demonstrated that the amide protons of residues Cys-8, Ala-20, Thr-1, 
Tyr-23, Tyr-2, Cys-3 and Arg-34 are protected from exchange with solvent. To explore whether 

20 these results were consistent with the average structure, the program DSSP (Kabsch, et al., 
Biopolymers 22:2577-2637 (1983)) was used to identify potential hydrogen bonds. DSSP 
identified the backbone amides of Ala-20, Thr-21, Tyr-23, and Arg-34 as potential hydrogen 
bond donors. In addition, solvent accessible surface area calculations showed that residues Cys- 
8 and Cys-3 3 were completely buried from solvent, though in the D2O spectrum the <xN 

25 crosspeaks of these two residues overlap, thus their individual protection from exchange is 
uncertain. Tyr-32 had only 8% solvent accessible surface area at the Cgprotons. 

The NMR structure gave a well resolved fold, however, as mentioned previously, 
canonical helices and p-sheets were not identified on the basis of nOes or other protocols 
including the chemical shift index (Wishart, et aL Biochemistry 31:1647-1651 (1992)) or Vhnci 

30 coupling constants (Wiithrich, K., supra). 

The guidelines for these protocols assigned secondary structure on the basis of four or 
more consecutive residues with similar conformational shifts or 3 Jhnu- Helical structure was 
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characterized by V HNa <o Hz and negative conformational shifts and P-sheet by Vhno^ Hz and 
positive conformational shifts. Fven in the active loop and stem region of the central loop, no 
regular secondary structure was identified by these criteria. However, the chemical shift index 
pointed towards a possible extended strand from residue 31 to residue 35. Table 3 sets forth the 
! H chemical shifts at 15°C. 3 J NH a, at 250°C and the NH temperature coefficients. 

Table 3 



MARP *H chemical shifts at 1 5°C in 50 mm phosphate buffer ai pH 4.2 



R emdup 


NH / nnm \ 
i > i i \ uyin f 


H„ (ppm) 


Hp (ppm) 


Utner (ppm ) 


Cvs- 1 


R 00 


j .uu 


1 1 1 1 Q 1 
J. 1 Z. Z.O 1 




V di ±. 


R Q? 


•4.18 


1 O 1 1 AO 

i .94, 2. (Jo 


CH T 3 0.91, 0.80 


Arg-3 


O O, 1 


4.U 1 


1.67. 1.9:? 


H T !,69, 1.82. H 5 3.24, 3.26. 










NH 7.31 


Leu-4 


O 1 1 


2.11 


1 j>4, 1.34 


H N ! 03, CH 5 3 0.79 


His-5 


8.99 


4.25 


3.70, 1.42 


H*7.22. Hr 8.51 


Glu-6 


8.04 


4 66 


2.16 


IJ'2.24 


Ser-7 


8.63 


4.68 


3.84 




Cys-8 


8.08 


4.96 


3.57, 3.13 




Leu-y 


/.yl 


4.08 


1.57. 1.44 


If 1.44, CH 6 3 0.82 


uly-10 


8.78 


4.07. 3.69 






uin- 1 1 


o.Uo 


4.24 


1.95 


If 2.28. 1^6.86, 7.49 


C\r\ 1 0 

\Mli- \L 


o.OcS 


4.2 / 


2.20, 1.95 


H r 2.30. 2.34, H e2 6.82, 










7.45 


v ai~ i j 


7 49 


a J7 


~.U3 


CH r 3 0.79, 0.90 


Pm 1 A 

r TO- i h 


C £ 1 


i re 
4.3!) 


2.2^, 1.99 


If 1.84, 1.99, H 8 3.65, 3.77 


L ys- 1 3 




4.92 


3.35, 1.74 


C\7C 1 A 

l ys- 1 o 






/;~) ■*) 1*7 

2.62, 3.17 






0.1.' 


a 7» 


-.0.}, Z.4 1 






O.OO 


a 




H 1. 98. H 3.89, 4.04 


Cys-19 




4.72 


2.94, 3.47 


Ala-20 


8.01 


4.94 


1.26 




Thr-21 


8.76 


4.58 


4.00 


</If, l .16 


Cys-22 


8.94 


4.58 


2.94, 3.02 




Tyr-23 


8.71 


4.62 


2.79 


H 5 6.93. If 6.74 


Cys-24 


8.30 


4.91 


3.23,2.60 




Arg-25 


8.2Q 


3.84 


1.83, 1.55 


M r 1.56, 1.23, H 8 3.06 


Phe-26 


7.91 


4.74 


2.80,3.32 


H 5 7.26, H E 7.42 


Phe-27 


8.54 


4.19 


3.12,3.02 


H s 7.17, H E 7.33 


Asn-28 


8.48 


4.20 


2.39, 7.77 


H 62 6.66, 7.30 


Ala-29 


7.75 


4.38 


1.25 


Phe-30 


8.38 


4.21 


3.32,3.36 




Cys-31 


8.34 


5.63 


2.59, 3.03 




Tyr-32 


8.88 


5.20 


2.59, 2.81 


H 6 6.92, H E 6.66 


Cys-33 


8.19 


4.96 


3.21,2.66 




Arg-34 


9.43 


4.75 


1.83, 1.70 


tTl.57, 1.69, H 5 2.62, 2.88, 
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Residue 


NH (ppm) 


H a (ppm) 


Hg (.ppm) 


utner ippm; 










XT i_t 7 no 


Lys-35 




A Al 

4.4 / 


1 0") 1 AC 

1 .oZ, 1 .do 


H r 1.25, 1.43, H 1.64, H 










o on 
2.6V 


Leu-36 


O HI 
0. / / 


A 1&AC\ 


1 A 7 


H : 1 .52, CH 3 (J. 52, 0.70 


Gly-37 


8.45 


A Aft A n 1 

4.UV, 4.U1 






1 nr-io 


/.OZ 


A Al 
4.4/ 


A Al 
4.4Z 


In 3 1.2 1 


Aiaoy 


O.J J 


H. I U 


1 Al 




Met-40 


5.U6 


4 /in 
4.4U 


i i r\d 

i .vj, z.uy 


H72.51, 2.62 


Asn-41 


7.74 




2.62, 2.8] 


T TUX T *T f C\C 

H 7.67. 6.95 


Pro-42 


8.68 


4.47 


2.29 


H Y 1.94. H 3.61, 3.67 


Cys-43 




4.59 


3.22,3.13 




Ser-44 


8.37 


4.46 


3.86 




Arg-45 


8.26 


4.43 


1.93 


H r 1.66, H 8 3.21, NH 7.25 


Thr-46 


7.86 


4.15 


4.23 


CHM.15 



All publications and patent applications mentioned in this specification are herein 
incorporated by reference to the same extent as if each individual publication or patent 
application was specifically and individually indicated to be incorporated by reference. 
5 The invention now being fully described, it will be apparent to one of ordinary skill in 

the art that many changes and modifications can be made thereto without departing from the 
spirit or scope of the appended claims. 
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Appendix 1 
Atomic Coordinates for MARP 





ATOM 


1 


CA 


CYS 


1 


-6.900 


2.269 


-8.250 


1.00 


4.62 




ATOM 


i 


HA 


CVS 


1 


-5.897 


2.041 


-7.920 


1.00 


4.59 


5 


ATOM 


3 


HB1 


CYS 


1 


-7.160 


4.143 


-9.269 


1.00 


5.13 




ATOM 


4 


HB2 


CYS 


1 


-6.230 


4.254 


-7.780 


1.00 


4.63 




ATOM 


5 


C 


CYS 


1 


-7.900 


1.623 


-7.293 


1.00 


3.97 




ATOM 


6 


O 


CYS 


1 


-9.uo7 


1.425 


-7.636 


1.00 


4.16 




ATOM 


7 


CB 


CYS 


1 


-7.088 


3.796 


-8.249 


1.00 


4.78 


10 


ATOM 


8 


SG 


CYS 


1 


-8.576 


4.376 


-7.358 


. 1.00 


5.12 




ATOM 


9 


N 


CYS 


1 


-7.069 


1.766 


-9.642 


1.00 


5.38 




ATOM 


10 


HT1 


CYS 


1 


-8.026 


2.021 


-9.958 


1.00 


5.84 




ATOM 


11 


HT2 


CYS 


1 


-6.942 


0.734 


-9.624 


1.00 


5.52 




ATOM 


12 


HT3 


CYS 


1 


-6.346 


2.222 


-10.234 


1.00 


5.55 


15 


ATOM 


13 


N 


VAL 


2 


-7.445 


1.356 


-6.070 


1.00 


3.32 




ATOM 


14 


HN 


VAL 


2 


-6.517 


1.593 


-5.848 


1.00 


3.27 




ATOM 


15 


CA 


VAL 


2 


-8.315 


0.801 


-5.028 


1.00 


2.84 




ATOM 


16 


HA 


VAL 


2 


-9.187 


0.372 


-5.505 


1.00 


3.11 




ATOM 


17 


CB 


VAL 


2 


-7.624 


-0.292 


-4.151 


1.00 


2.58 


20 


ATOM 


18 


HB 


VAL 


i 


-6.927 


0.197 


-3.490 


1.00 


2.54 




ATOM 


19 


CGI 


VAL 


~> 


-8.637 


-1.028 


-3.287 


1.00 


2.26 




ATOM 


20 


HG11 


VAL 


~> 


-9.533 


-1.215 


-3.860 


1.00 


2.67 




ATOM 


21 


HG12 


VAL 




-8.879 


-0.427 


-2.423 


1.00 


2.47 




ATOM 


m 


HG13 


VAL 


i 


-8.214 


-1.969 


-2.962 


1.00 


2.24 


25 


ATOM 


23 


CG2 


VAL 




-6.851 


-1.295 


-5.000 


1.00 


3.27 




ATOM 


24 


HG21 


VAL 


~> 


-6.240 


-1.922 


-4.350 


1.00 


3.49 




ATOM 


25 


HG22 


VAL 


"> 


-6.214 


-0.766 


-5.692 


1.00 


3.65 




ATOM 


26 


HG23 


VAL 




-7.544 


-1.914 


-5.548 


1.00 


3.65 




ATOM 


27 


C 


VAL 


1 


-8.750 


1.9^7 


-4.145 


1.00 


2.69 


30 


ATOM 


28 


O 


VAL 


1 
t— 


-8.214 


3.071 


-4.285 


1.00 


3.12 




ATOM 


2 l > 


N 


ARG 


-> 


-9.694 


1.759 


-3.238 


1.00 


2.64 




ATOM 


30 


HN 


ARG 


3 


-10.100 


0.873 


-3.138 


1.00 


2.81 




ATOM 


31 


CA 


ARG 


3 


-10.120 


2.851 


-2.380 


1.00 


2.90 




ATOM 


32 


HA 


ARG 




-10.214 


3.738 


-2.998 


1.00 


3.25 


35 


ATOM 


j j 


CB 


ARG 


3 


-11.466 


2.539 


-1.720 


1.00 


3.25 




ATOM 


34 


HB1 


ARG 


3 


-11.575 


3.157 


-0.841 


1.00 


3.44 




ATOM 


35 


HB2 


ARG 


3 


-11.477 


1.501 


-1.422 


1.00 


3.38 




ATOM 


36 


CG 


ARG 


3 


-12.662 


2.785 


-2.628 


1.00 


3.73 




ATOM 


37 


HG1 


ARG 


3 


-13.287 


1.905 


-2.628 


1.00 


4.00 


40 


ATOM 


38 


HG2 


ARG 


3 


-12.307 


2.976 


-3.630 


1.00 


3.91 




ATOM 


39 


CD 


ARG 


3 


-13.486 


3.978 


-2.164 


1.00 


4.23 




ATOM 


40 


HD1 


ARG 


3 


-13.348 


4.788 


-2.865 


1.00 


4.60 




ATOM 


41 


HD2 


ARG 


3 


-13.136 


4.285 


-1.189 


1.00 


4.30 




ATOM 


42 


NE 


ARG 


3 


-14.913 


3.662 


-2.078 


1.00 


4.74 


45 


ATOM 


43 


HE 


ARG 


3 


-15.185 


2.745 


-2.294 


1.00 


4.90 




ATOM 


44 


CZ 


ARG 


3 


-15.856 


4.540 


-1.726 


1.00 


5.32 




ATOM 


45 


NH1 


ARG 


3 


-15.532 


5.796 


-1.425 


1.00 


5.50 




ATOM 


46 


HH11 


ARG 


3 


-14.578 


6.091 


-1.459 


1.00 


5.28 
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10 



15 



20 



25 



30 



35 



40 



45 



ATOM 


47 


HH12 


ARG 


3 


-16.246 


6.446 


-1.162 


1.00 


6.06 


ATOM 


48 


NH2 


ARG 


3 


-17.129 


4.160 


-1.677 


1.00 


6.01 


ATOM 


49 


HH21 


ARG 


3 


-17.379 


3.219 


-1.903 


1.00 


6.19 


ATOM 


50 


HH22 


ARG 


3 


-17.837 


4.816 


-1.413 


1.00 


6.50 


ATOM 


51 


C 


ARG 


-> 

.•> 


-9.053 


3.103 


-1.329 


1.00 


2.75 


ATOM 


52 


O 


ARG 


3 


-8.519 


2.163 


-0.738 


1.00 


2.50 


ATOM 


53 


N 


LEU 


4 


-8.725 


4.372 


-1.111 


1.00 


3.24 


ATOM 


54 


HN 


LEU 


4 


-9.167 


5.082 


-1.615 


1.00 


3.68 


ATOM 


55 


CA 


LEU 


4 


-7.708 


4.726 


-0.146 


1.00 


3.40 


ATOM 


56 


HA 


LEU 


4 


-6.847 


4.143 


-0.371 


1.00 


3.11 


ATOM 


57 


CB 


LEU 


4 


-7.351 


6.196 


•0.264 


- 1.00 


4.25 


ATOM 


58 


HB1 


LEU 


4 


-7.379 


6.479 


1.304 


1.00 


4.60 


ATOM 


59 


HB2 


LEU 


4 


-6.353 


6.339 


0.110 


1.00 


4.61 


ATOM 


60 


CG 


LEU 


4 


-8.285 


7.097 


0.500 


1.00 


4.65 


ATOM 


61 


HG 


LEU 


4 


-8.344 


6.741 


1.509 


1.00 


4.56 


ATOM 


62 


CD1 


LEU 


4 


-7.773 


8.520 


0.517 


1.00 


5.17 


ATOM 


63 


HD11 


LEU 


4 


-7.232 


8.714 


-0.396 


1.00 


5.39 


ATOM 


64 


I1D12 


LEU 


4 


-7.113 


8.647 


J .363 


1.00 


5.60 


ATOM 


65 


HD13 


LEU 


4 


-8.605 


9.203 


0.599 


1.00 


5.27 


ATOM 


66 


CD2 


LEU 


4 


-9.659 


6.991 


-0.103 


1.00 


5.22 


ATOM 


67 


HD21 


LEU 


4 


-10.392 


7.382 


0.584 


1.00 
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ATOM 586 CB MET 40 11.888 -2.677 -2.505 1.00 4.33 

ATOM 587 HB1 MET 40 12.568 -1.837 -2.491 1.00 4.41 

ATOM 588 HB2 MET 40 11.222 -2.575 -3.350 1.00 4.46 

ATOM 589 CG MET 40 12.704 -3.945 -2.692 1.00 5.16 

5 ATOM 590 HG1 MET 40 12.841 -4.118 -3.751 1.00 5.45 

ATOM 591 HG2 MET 40 12.162 -4.774 -2.262 1.00 5.31 

ATOM 592 SD MET 40 14.325 -3.847 -1.907 1.00 6.00 

ATOM 593 CE MET 40 15.309 -3.173 -3.244 1.00 6.60 

ATOM 594 HE1 MET 40 15.iz4 -3.736 -4.146 1.00 6.92 

10 ATOM 595 HE2 MET 40 16.356 -3.239 -2.987 1.00 6.56 

ATOM 596 HE3 MET 40 15.041 -2.139 -3.404 . 1.00 7.01 

ATOM 597 C MET 40 11.386 -1.423 -0.381 1.00 3.37 

ATOM 598 O MET 40 11.559 -1.513 0.835 1.00 3.68 

ATOM 599 N ASN 41 11.510 -0.272 -1.048 1.00 3.35 

15 ATOM 600 HN ASN 41 1 1.386 -0.269 2 021 1.00 3.59 

ATOM 601 CA ASN 41 11.864 0.977 -0.377 1.00 3.57 

ATOM 602 HA ASN 41 12.586 0.745 0.390 1.00 4.02 

ATOM 603 CB ASN 41 12.503 1.948 -1.377 1.00 4.36 

ATOM 604 HB1 ASN 41 12.506 2.941 -0.952 1.00 4.61 

20 ATOM 605 HB2 ASN 41 11.920 1.953 -2.286 1.00 4.37 

ATOM 606 CG ASN 41 13.932 1.569 -1.727 1.00 5.30 

ATOM 607 OD1 ASN 41 14.191 0.473 -2.226 1.00 5.84 

ATOM 608 ND2 ASN 41 14.868 2.477 -1.470 1.00 5.81 

ATOM 609 HD21 ASN 41 14.592 3.331 -1.074 1.00 5.65 

25 ATOM 610 HD22 ASN 41 15.798 2.256 -1.685 1.00 6.50 

ATOM 611 C ASN 41 10.651 1.636 0.277 1.00 3.02 

ATOM 612 O ASN 41 9.507 1.390 -0.108 1.00 3.02 

ATOM 613 N PRO 42 10.898 2.490 1.291 1.00 3.12 

ATOM 614 CA PRO 42 9.833 3.192 2.018 1.00 3.15 

30 ATOM 615 HA PRO 42 9.154 2.496 2.484 1.00 3.54 

ATOM 616 CB PRO 42 10.588 3.974 3.102 1.00 4.06 

ATOM 617 HB1 PRO 42 10.553 3.428 4.033 1.00 4.67 

ATOM 618 HB2 PRO 42 10.133 4.945 3.230 1.00 4.13 

ATOM 619 CG PRO 42 11.983 4.089 2.593 1.00 4.46 

35 ATOM 620 HG1 PRO 42 12.673 4.160 3.421 1.00 5.17 

ATOM 621 HG2 PRO 42 12.072 4.956 1.954 1.00 4.81 

ATOM 622 CD PRO 42 12.235 2.834 1.810 1.00 3.83 

ATOM 623 HD1 PRO 42 12.611 2.055 2.456 1.00 4.33 

ATOM 624 HD2 PRO 42 12.927 3.025 1.002 1.00 3.84 

40 ATOM 625 C PRO 42 9.043 4.145 1.125 1.00 2.55 

ATOM 626 O PRO 42 9.610 5.065 0.531 1.00 2.68 

ATOM 627 N CYS 43 7.731 3.923 1.041 1.00 2.61 

ATOM 628 HN CYS 43 7.340 3.178 1.541 1.00 3.08 

ATOM 629 CA CYS 43 6.862 4.768 0.230 1.00 2.66 

45 ATOM 530 HA CYS 43 7.382 5.025 -0.623 1.00 3.10 

ATOM 631 HB1 CYS 43 4.942 3.935 0.709 1.00 2.52 

ATOM 632 HB2 CYS 43 5.840 3.031 -0.503 1.00 2.27 

ATOM 633 C CYS 43 6.535 6.042 0.962 1.00 3.31 

ATOM 634 O CYS 43 6.796 7.152 0.494 1.00 3.75 

41 
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0 560 


LOO 


8 91 




ATOM 

/A 1 VJ1V1 


661 


NF 


ARG 

/AlvVJ 


45 


8 384 


10.664 


1.981 


1 .00 


9.74 




ATOM 

/A 1 KJlrl 


66"> 

UUi- 


HF 
j li ^ 


ARG 

/AlVVJ 


45 


8 404 


10 675 


2.960 


1 00 

1 ■ VJVJ 


9 93 




ATOM 

/A 1 V/1V1 


66s 




ARG 

/AlVVJ 


45 


9 375 


10 047 


1 3°8 


1 00 


10 31 

1 W.J I 


30 


ATOM 

/A 1 WlVI 


664 


NFT1 


ARG 

/AlVVJ 


45 


9 388 

7JOO 


10 00^ 


-0 00^ 

VJ . \J VJ ^. 


1 00 


10 31 

1 V/. J I 




ATOM 

/A i v/ivi 


66 S 


HH1 1 
fin 1 1 


ARG 

/AlVVJ 


45 


8 653 


10 435 


-0 5"* 5 


1 00 

1 VJVJ 


9 86 




ATOM 

/A 1 V/1V1 


666 


HH12 

IH 1 1 z. 


ARG 

/AlVVJ 


45 


10 134 


9 541 


-0.479 


LOO 


10.88 




ATOM 

/A 1 WiV 1 


667 


NH° 


ARG 

/AlVVJ 


45 


10 357 


9.466 


2.01 1 


LOO 


11.06 




ATOM 

/A 1 V/iVl 


668 


HH°1 


ARG 

rAivvj 


45 


10 355 


9 490 


3.01 1 


LOO 


1 1.21 


35 


ATOM 

^ i. 1 \JiVX 


669 

\J\J J 


HFP9 
i ii 


ARG 

/V J.V *J 


45 


1 1 .100 


9.004 


1 .525 


L00 


11.58 




ATOM 


670 


r 

v. 


ARG 


45 


2.858 


10.387 


2.457 


LOO 


7.69 




ATOM 


671 


0 


ARG 


45 


2.168 


10.651 


1 .468 


L00 


8.34 




ATOM 


672 


N 


THR 


46 


2.336 


10.036 


3.633 


LOO 


7.60 




ATOM 


673 


HN 


THR 


46 


2.935 


9.842 


4.379 


LOO 


7 26 


40 


ATOM 


674 


CA 


THR 


46 


0.895 


9.934 


3.836 


LOO 


8.32 




ATOM 


675 


HA 


THR 


46 


0.488 


9.396 


2.997 


LOO 


8.41 




ATOM 


676 


CB 


THR 


46 


0.582 


9.157 


5.122 


LOO 


8 61 




ATOM 


677 


HB 


THR 


46 


-0.480 


9.219 


5.314 


LOO 


8.70 




ATOM 

/A 1 KJ J.V1 


678 


OG1 

WVJ 1 


THR 

1 I1IV 


46 
*tu 


1 966 


9 717 


6.233 


LOO 


8.93 


45 


ATOM 


679 


HG1 


THR 


46 


2.175 


9.410 


6.243 


1.00 


9.05 




ATOM 


680 


CG2 


THR 


46 


0.951 


7.688 


5.037 


LOO 


8.89 




ATOM 


681 


HG21 


THR 


46 


0.433 


7.232 


4.206 


LOO 


9.07 




ATOM 


682 


HG22 


THR 


46 


0.668 


7.191 


5.954 


LOO 


9.06 




ATOM 


683 


HG23 


THR 


46 


2.017 


7.593 


4.891 


1.00 


9.01 
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ATOM 


684 


C 


THR 


46 


ATOM 


685 


OT1 


THR 


46 


ATOM 


686 


ot: 


THR 


46 


END 
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0.249 11.319 3.889 1.00 9.05 
0.759 12.187 4.632 1.00 9.34 
-0.761 11.52? 3.184 1.00 9.53 
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CLAIMS 

What is claimed is: 

1 . A method of identifying a compound that modulates ligand binding to a melanocortin 
receptor, said method comprising: modeling test compounds that fit spatially into a melanocortin 

5 receptor ligand binding site of interest using an atomic structural model of a melanocortin 
receptor binding region cr portion thereof; screening said test compounds in an assa> 
characterized by binding of a test compound to a melanocortin receptor ligand binding site; and 
identifying a test compound that modulates ligand binding to said melanocortin receptor. 

2. The method of Claim 1 wherein said melanocortin receptor binding region comprises the 
1 0 minimized agouti related protein receptor binding region or portion thereof 

3. The method of Claim 1 wherein said atomic structural model comprises atomic 
coordinates of amino acid residues corresponding to residues 1-18 of the N-terminal loop of the 
minimized agouti related protein (residues 1-18 of SEQ ID NG:2), residues 19-13 of the central 
loop of the minimized agouti related protein (residues 19-34 of SEQ ID NO:2), and residues 35- 

15 46 of the C -terminal loop of the minimized agouti related protein (residues 35-46 of SEQ ID 
NO:2). 

4. The method of Claim 1 wherein said atomic structural model comprises atomic 
coordinates of amino acid residues corresponding to residues 1 9-34 of the central loop of the 
minimized agouti related protein (residues 19-34 of SEQ ID NO;2) and at least residues 15-18 of 

20 the N-terminal loop of the minimized agouti related protein (residues 1 5-18 of SEQ ID NO:2). 

5. The method of Claim 1 wherein said atomic structural model comprises atomic 
coordinates of amino acid residues corresponding to residues 1 9-34 of the central loop of the 
minimized agouti related protein (residues 19*34 of SEQ ID NO:2) and at least 20% of the 
contiguous or non-contiguous residues or their atoms are selected from residues 1-1 8 of the N- 

25 terminal loop of the minimized agouti related protein (residues 1 - 1 8 of SEQ ID NO:2). 

6. The method of Claim 1 wherein said atomic structural model comprises atomic 
coordinates of amino acid residues corresponding to residues 24-31 of the active loop of the 
minimized agouti related protein (residues 24-31 of SEQ ID NO:2). 

7. The method of Claim 1 wherein said atomic structural model compnses atomic 

30 coordinates of amino acid residues corresponding to residues 25-27 of the active loop of the 
minimized agouti related protein (residues 25-27 of SEQ ID NO:2). 

8. The method of Claim 1 wherein said screening is in vitro. 

9. The method of Claim 8 wherein said screening is high throughput screening. 
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10. The method of Claim 1 wherein said assay is a biological assay. 

1 1 . The method of Claim 1 wherein said test compound is from a library of compounds. 

12. The method of Claim 1 wherein said test compound is an agonist or antagonist of ligand 
binding, 

5 13. The method of Claim 12 wherein said test compound is a small organic molecule, a 
peptide, or peptidomimetic. 

14. A method for identifying an agonist or antagonist of iigarid binding to a melanocortin 
receptor, said method comprising the steps of: providing the atomic coordinates of a 
melanocortin receptor binding region or portion thereof to a computerized modeling system; 

10 modeling compounds which match or mimic the receptor binding region and thus fit spatially 
into the melanocortin receptor ligand binding site; and identify ing in an assay for melanocortin 
receptor activity a compound that increases or decreases tiie activity of said melanocortin 
receptor by binding the ligand binding site of said melanocortin r eceptor, whereby an agonist or 
antagonist of ligand binding is identified. 

15 15. The method of Claim 14 wherein said melanocortin receptor binding region comprises 
the minimized agouti related protein receptor binding region or portion thereof. 

16. A machine-readable data storage medium, comprising a data storage material encoded 
with machine readable data which, when using a machine programmed with instructions for 
using said data, is capable of displaying a graphical three-dimensional representation of a 

20 molecule that binds a melanocortin receptor comprising structure coordinates of amino acid 

residues corresponding to residues 1-18 of the N-terminal loop of the minimized agouti related 
protein (residues 1-18 of SEQ ID NO:2), residues 19-13 of the central loop of the minimized 
agouti related protein (residues 19-34 of SEQ ID MO:2), and residues 35-46 of the C -terminal 
loop of the minimized agouti related protein (residues 35-46 of SEQ ID NO:2), or a homologue 

25 of said molecule. 

17. The machine readable storage medium of Claim 16 wherein said molecule is a 
melanocortin receptor agonist. 

18. The machine readable storage medium of Claim 16 wherein said molecule is a 
melanocortin receptor antagonist. 

30 19. The machine-readable data storage medium according to Claim 16 wherein said 

molecule is defined by the set of structure coordinates depicted in Appendix 1, or a homologue 
of said molecule, said homoiogue having a root mean square deviation from the backbone atoms 
of said amino acids of not more than 2. 54 A. 
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20. A machine-readable data storage medium comprising a data storage material encoded 
with a fust set of machine readable data which, when combined with a second set of machine 
readable data, using a machine programmed with instructions for using said first set of data and 
said second set of data, can determine at least a portion of the structure coordinates 

5 corresponding to the second set of machine readable data, wherein; said first set of data 

comprises a Fourier transform of at least a portion of the structural coordinates selected from the 
group consisting of coordinates depicted in Appendix 1; and said second set of data comprises 
an X-ray diffraction pattern of a molecule. 

21. An NMR structure of the minimized agouti related protein, embodied in a computer 
10 readable media. 

22. A compound identified according to the method of Claim 1 . 

23. A polypeptide comprising the amino acid sequence: 

CVRLHESCLGQQVPCCDPAATCYCRFFNAFCYC (SEQ ID NO:3) 
or a modified form thereof. 
15 24. The polypeptide of Claim 23 wherein the polypeptide is chemically synthesized. 

25. A method of treating a disease state in mammals that is alleviated by treatment with a 
polypeptide having an amino acid sequence: 

CVRLHESCLGQQVPCCDPAATCYCRFFNAFCYC (SEQ ID NO:3) 
which method comprises administering to a mammal in need of such a treatment a 
20 therapeutically effective amount of said polypeptide, or a pharmaceutically acceptable salt 
thereof. 

26. The method of Claim 25wherein said disease state is a wasting syndrome 

27. A pharmaceutical composition comprising a therapeutically effective amount of a 
polypeptide of the sequence: 

25 CVRLHESCLGQQVPCCDPAATCYCRFFN AFCYC (SEQ ID NO:3) 

or a pharmaceutically acceptable salt thereof 
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SEQUENCE LISTING 

<110> The Regents of the University of California 

<12 0> METHODS AND COMPOUNDS FOR MODULATING MELANOCORTIN 
RECEPTOR- LIGAND BINDING 

<130> UCAL-257/00WO 

<140> 
<141> 

<160> 3 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 132 
<212> PRT 

<213> Homo sapiens 
<400> 1 

Met Leu Thr Ala Ala Leu Leu Ser Cys Ala Leu Leu Leu Ala Leu Pro 
15 10 15 

Ala Thr Arg Gly Ala Gin Met Gly Leu Ala Pro Met GIu Gly lie Arg 
20 25 • 30 

Arg Pro Asp Gin Ala Leu Leu Pro Glu Leu Pro Gly Leu Gly Leu Arg 
35 40 45 

Ala Pro Leu Lys Lys Thr Thr Ala Glu Gin Ala Glu Glu Asp Leu Leu 

50 55 SO 

Gin Glu Ala Gin Ala Leu Ala Glu Val Leu Asp leu Gin Asp Arg Glu 
65 70 75 so 

Pro Arg Ser Ser Arg Arg Cys Val Arg L- . His Glu :--r Cys Leu Gly 
85 .-.o 95 

Gin Gin Val Pro Cys Cys Asp Pro Cys Ala Thr Cys Tyr Cys Arg Phe 
100 105 no 

Phe Asn Ala Phe Cys Tyr Cys Arg Lys Leu Gly Thr Ala Met Asn Pro 
115 120 125 

Cys Ser Arg Thr 

130 
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<210> 2 
<211> 46 
<212> PRT 

<2I3> Homo sapiens 
<400> 2 

Cys Val Arg Leu His Glu Ser Cys Leu Gly Gin Gin Pro Cys Cys 

15 10 15 

Asp Pro Cys Ala Thr Cys Tyr Cys Arg Phe P'fve A.r the Cys Tyr 

2 0 2 S 3 0 

Cys Arg Lys Leu Gly Thr Ala Met Asi; Pro Cy:: : v; -\x g : h.r 
3 5 4 0 



<210> 3 
<211> 33 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 3 

Cys Val Arg Leu His Glu Ser Cys Leu Gly Gin , Gin Val Pro Cys Cys 
15 10 15 

Asp Pro Ala Ala Thr Cys Tyr Cys Arg Phe Phe Asn Ala Phe Cys Tyr 

20 25 30 

Cys 
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